Decision-Dependent Distributionally Robust Markov Decision Process Method in Dynamic Epidemic Control
نویسندگان
چکیده
In this paper, we present a Distributionally Robust Markov Decision Process (DRMDP) approach for addressing the dynamic epidemic control problem. The Susceptible-Exposed-Infectious-Recovered (SEIR) model is widely used to represent stochastic spread of infectious diseases, such as COVID-19. While Processes (MDP) offers mathematical framework identifying optimal actions, vaccination and transmission-reducing intervention, combat disease spreading according SEIR model. However, uncertainties in these scenarios demand more robust that less reliant on error-prone assumptions. primary objective our study introduce new DRMDP allows an ambiguous distribution transition dynamics. Specifically, consider worst-case probabilities within decision-dependent ambiguity set. To overcome computational complexities associated with policy determination, propose efficient Real-Time Dynamic Programming (RTDP) algorithm capable computing policies based reformulated accurate, timely, scalable manner. Comparative analysis against classic MDP demonstrates achieves lower proportion infections susceptibilities at reduced cost.
منابع مشابه
Distributionally Robust Markov Decision Processes
We consider Markov decision processes where the values of the parameters are uncertain. This uncertainty is described by a sequence of nested sets (that is, each set contains the previous one), each of which corresponds to a probabilistic guarantee for a different confidence level so that a set of admissible probability distributions of the unknown parameters is specified. This formulation mode...
متن کاملRobust partially observable Markov decision process
We seek to find the robust policy that maximizes the expected cumulative reward for the worst case when a partially observable Markov decision process (POMDP) has uncertain parameters whose values are only known to be in a given region. We prove that the robust value function, which represents the expected cumulative reward that can be obtained with the robust policy, is convex with respect to ...
متن کاملDistributionally Robust Optimization for Sequential Decision Making
The distributionally robust Markov Decision Process approach has been proposed in the literature, where the goal is to seek a distributionally robust policy that achieves the maximal expected total reward under the most adversarial joint distribution of uncertain parameters. In this paper, we study distributionally robust MDP where ambiguity sets for uncertain parameters are of a format that ca...
متن کاملRobust Markov Decision Processes
Markov decision processes (MDPs) are powerful tools for decision making in uncertain dynamicenvironments. However, the solutions of MDPs are of limited practical use due to their sensitivityto distributional model parameters, which are typically unknown and have to be estimated by thedecision maker. To counter the detrimental effects of estimation errors, we consider robust MDPs...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: IISE transactions
سال: 2023
ISSN: ['2472-5854', '2472-5862']
DOI: https://doi.org/10.1080/24725854.2023.2219281